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DISCLAIMER 


This draft report to the Western Energy and Land Use Team 
contains the findings and opinions of the contractor team 
on WELUT Project 02 (implementation of a computer mapping 
system), and does not necessarily reflect the opinions of 
the Western Energy and Land Use Team or the personnel of the 
U.S. Fish and Wildlife Service, Region Six. 


This report is meant for use within the Fish and Wildlife 
Service only. Please do not circulate outside the Fish and 
Wildlife Service. 
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Report 1.2 


ABSTRACT 


This report is the last in a four part report titled 
User Needs Assessment. The other three volumes discussed the 
actual user needs in terms of both analysis and required data. 

They also discussed possible areas in Wyoming and Montana that could 
be used as test areas. (Appendix B gives the abstracts from the other 
three reports). 

Report 1.2 of the Task I Final Report User Needs Assess- 
ment presents a general system design and outlines a basic set of 
system capabilities for the U.S. Fish and Wildlife Service's Geo- 
graphic Information Systems. The system is to operate at two levels. 
One iS a manual system for cataloging maps, relevant articles, and 
other paper documents. The second level is the automated level. At 
the automated level, a computer based information system will allow 
for data input, data analysis, and information output based on user 


specified criteria. 
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1.0 


SYSTEM DESCRIPTION AND REQUIRED PERFORMANCE CRITERIA 


Introduction 


The goal of this project is to develop a coordinated manual 

and automated Geographic Information System (GIS) for the U.S. 
Fish and Wildlife Service. Consequently, the function of this 
report is to outline the general conceptual design of a GIS for 
FWS. A broad set of system capabilities are presented. This 
represents the first step in the design process. Later design 
steps will firm up and outline in exact detail all the functional 
characteristics of the system. To begin with, the system must 


have both a manual component and an automated component. 


The proposed system configuration will allow many users who have 
simple spatial problems to use a manual information system, while 


more complex spatial problems would utilize the more sophisticated 


power of the automated information system. The system design 

is necessarily based on the findings of a User Needs Assessment 
(Report Teel) 4) The requirements of the users serve to define the 
operational boundaries of the information system. It iS necessary 
only to fulfill the needs of FWS - not the entire world. The user 
needs also serve to define what are the relevant internal input 
analysis and display capabilities. These capabilities in turn are 
dependent on the data required by the users. Sunmarizing from 
Report 1.1, the FWS users group requires data of the following 


general characteristics: 


luser Needs Assessment for an Operational Geographic Information 
System Within the U.S. Fish & Wildlife Service 


(1) different data resolutions or geographical 
accuracy ranging from a 2 1/2 acre resolu- 
tion (1:10,000 scale map) necessary for site 
Specific planning and facility location to 
640 acre resolution (1:500,000 scale map) suit- 
able for macro region wide planning of an entire 
river basin or state. 


(2) different data currency or updating frequency 
ranging from the requirement for 3-6 month 
currency for corporate mineral development plans 
to 10 year currency for soils and geologic map data. 


(3) “di tterent geographical coverage or extensiveness of 


the study area ranging from a 200 square mile study 

area around a small water development project to an 

85,000 square mile study of the entire state of Wyo- 
ming. 


(4) differing data formats ranging from summary tabular 
reports to detailed 1:24,000 scale maps. 
The general system design presented in this report is based 
on data required (both existing and hoped for) by FWS personnel 


to fulfill the mandated objectives of the Fish & Wildlife Service. 


The discussion of the system is presented in three main sections. 
Tne first section (1.1) will give the system description in a 
very broad, non-technical format. The second section (1.2) dis- 
cusses the manual portions of the GIS. For those who are in- 
terested, the third section presents automated system description 


in finer detail both technically and conceptually (1.3). 


A | FWS GIS: Introduction and Overview 


Whether a eyeten is completely manual, completely automated, or 
a combination of both, it will maintain five primary functions: 
1) acquisition of data and its insertion into the data base, 2) 
maintenance of and retrieval from the data base, 3) analysis of 
data retrieved from the data base, 4) monitoring of the system, 


and 5) interaction between the user and the data base. 


Operating an information system is in many ways like operating a 
restaurant. If you think of data as food, the five functions are 
quite readily understood. To begin, food must be acquired (data 
acquisition). Usually it is easiest to purchase food from com- 
mercial sources (secondary data acquisition) provided that the 
needed items are available, the quality is acceptable, and the 
Supply is reliable. If the food is not available, then the more 
expensive, time consuming process of growing food must be considered 
(primary data collection). Generally, it is best to use readily 
available items, or at least have any necessary growing done under 


contract. 


Once acquired, the food has to be stored (data input). Before the 
food is stored, it must be checked for quality and freshness (input 
data editing). Items that are spoiled or imperfect must be discarded 


and replaced. 


Once the food is stored, the chef must be able to retrieve the food 


items as required by the demands (orders) of the customers (data 


users). At the same time, the chef keeps a close check that the 
food is not going bad, and replenishes his supply to meet the 


demands of his customers (data update). 


After the food is retrieved from Storage, the chef must prepare and 
cook the items to meet the specific demands of his customer (data 
analysis and data output). Ina restaurant, the degree of preparation 
and presentation may be minimal. A smorgasbord, for instance, may 
require only retrieval, cleaning, and layout of a selection of food. 
The degree of preparation, such as in a French cuisine restaurant, 

may be much more complex. In such instances, the patron may wish to 
Prepare his own meal. The point is that some customers wish only to 
get a quick peanut butter and jelly sandwich, while others may want 

a four course meal with fine wines, (served by a wine steward), after 


dinner liquers, and the service required of nobility. 


Critical to the existence and profitability of a high calibre res- 
taurant is the existence of demanding clients (users of the information 
system). In the first stages of establishing a restaurant, it is 
unwise to provide dishes that are too complex and too expensive for 

the client's taste. The restaurateur must decide whether to cater 


only to existing demands or whether gradually to introduce new dishes 
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with which the client can become familiar and eventually request. The 


restaurant management can do this by monitoring reactions of his clients 


to the dishes and listening to their suggestions and complaints. 


One point where the client and restaurateur interact is the menu 
(user interface). The client makes his selections from the menu. If 
the menu is lacking, the client will say so. He may possibly get 


up and leave the restaurant, never to return. 


The analogy between a restaurant and an information system should 

not be drawn too far, for the processes that make up an informative 
system are complex, segmented and closely interrelated. First, the 
manual and computer aided interface of the system will now be dis- 
cussed in greater detail. Then, a more detailed conceptual and techni- 


cal description of the FS GIS is presented. 


The AUTOMATED-MANUAL System Interface 


A computer based geographic information system must necessarily 
consist of two operational components: an automated portion and a 
manual portion. The automated portion consists of those system 
elements that are 1) amenable to computerization, 2) frequently used, 


and/or 3) require complex calculations for use by the user. The 


manual portion of the system consists of those system elements (such 
as map files, index card files, and field notes) which can be 


efficiently handled using manual techniques. The manual approach may 


be more desirable to implement than the automated system. 

The manual approach may be desired due to infrequent use of 
data due to the effectiveness of the present format due to the 
difficulty and cost of automation, or due to the fact that 


the present manual system effectively serves the needs of its users. 


Figure 1.1 presents a graphic representation of the automated- 
manual configuration of the system. Both the automated and 

manual portions of the system consist of two basic types of data: 
image and descriptor. Image data, also known as graphic or map 
data, range from uninterpreted aerial photographs to printed base 
maps. Image data may be tied to some geographic reference system, 
such as Latitude/Longitude, State Plane Coordinates, or Universal 
Transverse Mercator. Descriptor data, usually in the form of 
tables, text, or labels, may be maintained in the form of field 


notes, completed questionnaires, or card files. 


Descriptor data may be tied to image data through a geographic 
reference. For example, as shown in Figure 1.1, soil interpreta- 


tion reports can be tied geographically to soil survey maps. 


Image or descriptor may be either in manual form or automated 


(computer) form. Image data may be entirely devoid of descriptive 


information, such as a map of soils, polygons or uninterpreted 
aerial photographs. On the other hand, descriptor data may be 
entirely devoid of any geographic referencing scheme, such as soil 


interpretation guides or bibliographies. However, the majority of 
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the data of interest for the use in the FWS GIS contains both 


descriptor and image data in some logical relationship. 


Figure 1.1 shows two important relationships: 
(1) the automated-manual interface relationship, and 


(2) the degree of overlap between the automated and 
manual interfaces. 


The interface between the automated and manual portions of an 
information system has often been ignored, especially in the 
design stages of the system. This is a serious shortcoming, es- 
pecially when an automated system is going to be introducted into 


a previously manual environment. 


If the automated system is to maintain viability, the capability 
must exist to automate manual data when it is cost efficient to 

do so, and to provide the user with the ability to simply index 

manual data such as field notes, which are not cost efficient 

to automate. This amount of interface is depicted by the degree 
of overlap between the manual circles and the automated circles 


in Figure 1.1 As the system becomes more automated, the degree 


of overlap increases. A point of complete overlap comprises a 


totally automated system. 


The Manual System 


The manual map file system will store and jndex paper copy maps, 
tables, and charts. This will be accomplished through an organ- 
ized set of map file cabinets and a series of indexes. The in- 


dexes would exist in both an automated and manual form. 


eon | Methods of Indexing Data 


All data (maps, charts, tables, surveys) that cover a test 
area will be stored and indexed in the manual system (see 
Report 1.4 for case study discussion). These spatially 
related data will be indexed by the following data character- 
TSCLCS 

1) Location by USGS 7-1/2 minute quadrangle. This 
provides an inmediate tie to an index map. 

2) Variable type (soils, vegetative cover, land use) 

3) Format (maps, aerial photos, tables, charts) 

4) Resolution in terms of minimum area identified on 
the map or table. This could be defined by minimum 
acreage, size on the map, or minimum resolution of 
tabular data (e.g., county level census). 

5) Vintage which includes both the date of creation 
and later dates of updating. 

6) Physical location. Since this manual index will 
include data stored at the Billings area office, 
WELUT offices, Region Six offices, and other FWS 


offices, its physical location will be indicated. 


aes 


Physical Construction of the Manual System 


Four components make up the physical structure of this 


manual system. They include: 


1) A set of large map file cabinets to be maintained at the 
WELUT offices, at the Billings Area Office, and in the 
Denver Regional Offices. Maps are stored in these cabinets 
in an organized manner. Special arrangements are also re- 
quired for aerial photographs. 

2) A set of file cabinets for storage of descriptive data. 

These descriptive data would include such items as reports, 
U.S. Census data, wildlife survey data, and water quality 
data. Where applicable, the descriptive data will be cross- 
referenced with the image data (maps and aerial photographs). 

3) One card catalog index of all data in all three offices (BAO, 
WELUT and Denver). This card catalog, much like a library 
card index system would index each map, table or chart, by 
the six characteristics listed above in 1.2.1. An identical 
copy of this card index could be placed in every biologist's 
office. 

4) An automated file which contains the card catalog index. 

This automated file would be updated when new maps are 


located and stored in any office (Denver, WELUT, or BAO). 


This automated file could be printed out to produce new 
card index sets. The automated file can be sorted or sub- 
setted to deliver a specialized card index for selected 


topics or geographic regions. 
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The above components represent the general conceptual structure 
of a manual geographic information system. However, before the 
Frees system can be implemented, much more detailed speci fi- 
cations are required. These specifications would be centered 
upon: (1) the basic storage equipment, (2) document imput 

forms and document editing, sorting and output routing, (3) pro- 
tection from people damage, molds, and rodents, (4) back-up 


storage capabilities, (5) reproduction facilities, (6) updating, 


and (7) staffing and budgeting. 


The Automated System 


In order to effectively meet the geographic data needs of BAO, 

the Regional Offices, and OBS Special Projects, and automated Geo- 
graphic Information System is recommended to operate in parallel 
with the manual system. This Geographic Information System (GIS) 
must be able to serve the common needs of the user groups in a 
timely, accurate and cost effective manner while remaining compat- 


ible with existing manual systems of handling data. 


Geographic Information Systems is not a field by itself, but 
rather a common ground between information processing, geography, 
geology, biology, land use planning and the many other fields 
utilizing spatial analysis techniques. The boundary condition 


that separates GIS from other types of information systems is the 


Spatial nature of the data. 


dhs Sig Overall System Design 


Given this broad framework for Geographic Information Systems, 
the next phase in the system design process is a conceptual design 


of the proposed automated system. Such a design says nothing about 


exact processes, physical characteristics of hardware or software, 
or data characteristics; it is more the initial expression of the 
general systems capabilities, logical structure, and boundary con- 


ditions of the proposed system based on the present and future 


needs of the perceived users. 


The capabilities and structure of any GIS can be described in terms of 
Six major subsystems (Figure 1.2): 1) a data input subsystem, 2) a 

data base management subsystem, 3) a data display and analysis sub- 
system, 4) an information monitoring subsystem, 5) a user interface 
language processor subsystem, and 6) an implied administrative manage- 
ment subsystem. Given the needs of FWS users, five of the six subsystems 
were outlined in Figure 1.2. Except for the management subsystem, each 


of the five GIS subsystems are discussed below in terms of general 


characteristics. No statements are made regarding implementation methods 
or operational procedures. 


ee Information Monitoring and Use Subsystem 


The Information Monitoring and Use Subsystem is the point at which 


information produced by the systems is delivered to the user and is 


brought to bear on a problem of concern. The key to this subsystem is 


the monitor process which allows for users to express their satisfaction 
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Figure 1.2 Preliminary Geographic Information System Design 
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or dissatisfaction with the products and operation of the 
system. Feedback, sometimes referred to as the theory of the 
"screaming user," begins during the initial design process and 
continues during the life of the system. The feedback is used 
to correct weak points, add new capabilities, and delete un- 


used capabilities. 


The second monitor function is more internal to the system 
but again relates back to the users of the system. The internal 
monitor function acts as an accounting system and keeps track of 
several operational aspects of the system. Some of the aspects 
monitored might be: 

(1) how often a particular data base is accessed; 

(2) number of maps generated; 

(3) time required to perform a particular analysis; 

(4) an update file containing information on when 

data bases were updated; and 


(5) accounting by user i.d. codes. 


Internal monitoring is often referred to as tuning. The infor- 
mation provided by a tuning procedure can show where the system 
is too expensive to run, where bottlenecks occur, or what are 
the basic types of questions addressed to the system. It allows 
for the continual optimization and evolution of the system based 


on what the users are doing. 


eS Oe 


Data Analysis Subsystem 


The Data Analysis Subsystem serves two primary functions. 
First, it provides a basic set of techniques for doing 


spatial display & analysis of geographically referenced 


‘data. Second, it allows the GIS to interface with exist- 


ing and planned FWS modeling efforts. The specific spatial 
display and analysis techniques implemented depend on the 
types of analysis & display most commonly required by the 
different branches & projects within FWS (refer to report 
1.1), They can be grouped under three major headings: 

1) mapping & graphic display; 2) statistical analysis & 
report generation; 3) map overlay analysis, and 

4) modeling effots. These three analysis & display groups 
operate on the data retrieved from the data base given a 
set of user criteria. The analysis techniques themselves 
may also generate new data bases that can be later analyzed 
by the same or a different user. For example, correlation 
analysis generates a correlation matrix, which could be 
stored as a data base & then retrieved for later analysis. 
The three major analysis & display groups are now discussed. 
Mapping and Graphic Display 

Mapping & graphic display is the process of converting 

the automated image & descriptor data into graphic format. 
Mapping is the cartographic representation of the image 
data. Examples of map display would be contour maps, choro- 
pleth maps, or dot maps. Maps can be produced on a variety 
of output devices. The usual output devices are the line 
printer, digital plotter, cathode ray tube, & microfilm 
plotter. Figures 1.3 through 1.6 are examples of some 


common map displays. 
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FIGURE 123 
LINE PRINTER CONTOUR MAP 


adn od te deeds aka ACs LO ACO OCC ihe si RO IO.c LION AO OUOOO CI OOUCOOORUROUOboGOCGn. nr 


° 

e 

° 

° 

e 

° 

e 

35 6 666 ° 

Go SS bbe 66564 ° 
6 555 be 6onbnr 6 ° 
4 555 bt obobbnbb 3 
SS bhobASA 666 ° 
555 € 65065 6064 OO6H6 ° 
55 66646 Ss 4606066046 ° 
S55 64666 555 660666066 ° 
555 66€60 SESSS5S5S 66660666 64 5 
555 66 55€&555555 66006666 6666 A 
555 555£55555 666066 664 : 
25 9555556555 6666e i - 

6 555 55555555555 66666 666 66666 5 ° 
4g 55555555555555&5655 66660 6666666 5555 ° 
64a 55555&5555555555555 6666 TIT77777 6666666 5555 5 
464 5555555555£&5555 65666 VIALE 666666 5555 e 
444% 5555555555555 6666 TIITTITITFIIITTIT 6666 §§555 555 ® 
6446 55555556555 4KA6 7777 Tatar vat 60645 5555 e 
44464 55555555 466 uitfae 77777070 66666 e 
646646 555556 666 777? BHAA TITVITIII 6666666666466 66666 ° 
444644 555536 646 777 3AB6BBBSBA (T/7I7TI7770 6666666664666666 Z 
44444 55°55 666 77 HABA 88898388 WR RT 6 sPavaranr 5 

3 444444 5556 66 T77 #886 999 8Bd4 Wadi? VATE ° 
3 444446 $sse 656 77 6RH 999999999999 B9RBbHH (7777177 VI7777 6888 e 
3 4444646 5555 565 77 BR 99 9999 886888848 77777 SdABBRbSBB e 
3 644444 5555 56 77 KK 999 0000 ¥999 BB8are R8HBS9 e 
33 44444 55S 44 77 ak OY 0000U0UV00000 999999 2e8aseeess 99999999999 e 
333 4464 5555 6H 77 B& OY 000 J0vd9 999 8888 999 9999 e 
3333 664644 555 66 77 BR $99 000 0000 9999 9 0000 ° 
333 4466 Sts +666 77 BR 99 Hy BOBBED BLU. | Gist 9999999999 00 0000000" e 
333 4444 De) h6G6 77 = 8a 9S Yin eS $1411 uvdgu 9999 00 vo00000V000 ° 
333 444 555 646 Uh kee CA He) 1)) 2 111) 0090 au voou ° 
333 4444 55 66 77 BAR 99 YY Ml) 222222 1)1)1 000000000n0 ° 
333 4444 555 66 ae Wea) OO) Sah) 11) 22222222e2 ghdi1111 V1li)1 ° 
333 944 55 6€ 77 8s 99 OV 1} 222222 2é2e¢ VRP VPTAT LLL ° 
333 444 DS et 66) Pert Te BAGO is Ty 2222 d2¢2 1) 22222 2 ° 
33 444 5s CK ATA FEN Gy ie 2222 2672 22222e2222 3333333 ° 
33 444 S57 9666.97.70 NRSC a Ih 2222 3 c2e2 e222 33323 ° 
33 444 555 66 $77 RK QQ) gy 1) cée 33393 622222 e¢ep 333 944446 e 
33 444 Ss Se aE) SY ide 333333333 222222222 333 4444 555 ° 
33 444 4S 66 T be 99 90 le neice 33333333335 22222222 333 44 55555 ° 
3 446 55 isp dy 8 ini i 22 33 33333333 2222222 33 4% Ct) ° 
3 46> (55) ° 066 TT BR” Ge vag yi 22 333333333333 3300 44 (565 5 
3 444 Se) TKS ae IG) SW NN ee 333333333433 333 44 55985 e 
3 464 So) ROG B77 ee Sinn FO mIGOOn eli} 22 3333333333333333 3333 446 Se ° 
3 446 55 OST T SAP Sa) Von 11 222 «=3.3333333353533333 3333 446 «a5 ‘ 
33 444 55) OO 7. ARG GO 11) 222 333333 3333333333 3333 446 55 ° 
33 444 555 666 77 98 99 Wid By 22 33333 3343333333 3333 SS: ° 
33 44 555 SGN 7788" WSS a ei) 22 3333 33333333333 3333 64 5 7 - 
333 444 35 66 77 88 999 yO 11 222 333) 334393333343 333 4 55 66 7 e 
333 44 555 6£ 77 dk 99 UG PLY v2 333° 333433333333 3333 44 55 66 ° 
333 646 55 GO Tiss igi 9 ORUUON leuece 33°) 993333333333 3333 44 5 66 ° 
353 44 5545 €6 77 6A 999 yy Dio ide 3333335339333 3333 46 S 66 8 e 
SEIS) 46 DSS) 166 77 @B 9Y OU iileeee 3333353333 3333 414464 § 4606 Tatar ° 
3330 444 55 66 7F 888 49 Qun 3] 2222 3333233 3333 464% 55 6606 =77 ; 

22 33 444 55 666 77? AK Y¥S UA 1) 222 33333 64% 555 666 77 ° 
222 33 444 555 666 77 88 GG oc V1 222 332333 644 55555555 666 7 e 
222 33 446 55 66 77 BR 59 000 11) 2222¢2c222222 3333 4% 5555555555 666 777 e 
222 333 444 5§5 66 77 ABA 99 CO hp 22¢2¢2222222 3333 44 5959555555555 666 77 ) 
222 333° 444 £55 46 77 Abe 99 000 ity 2222 3339 46 =6555 ) 6§§556555555 666 ; 

11 22 333 444 56 60 77 eb 999 000 11) 22e 33 64 6&5 §55 5555555 666 Ma hires 
S| 222 33 44 65 66 77 6B 999 000 Yl1V421 eee 333 4q@ 5 55 555555555 066 ee 
111 22 333 44 55 4h 77 8HB 9999 "00 14112) 2222 33 444 555 555555555 666 5 

0 1) Grd ss itr SR (a5 afi 88 999 90N0 3h) 222 33 696 444 5555555 55 6666 . 
00 1 A) 2c 333 44 5S5 66 777 RR 999 00040 1111 2222 333 446444 5955555555555 6666.6 
000 WIT Vaze SS 446 SS G6: 07, 44B 9999 vdva0 hill 222 3334 444 555555555555 6666. 
00U 1) Lecce 38) 166 55 9666) 27 68886 999 v000 1.1 222 3333 464 35555555 666 » 

99 vu0 }))1 2e 23 464646 55 6 17 888 999 0u0 Y1q2 2222 31333 464 555 6666 . 
999 Hoy ew Cory 3G 55 66 TT777 888 9994 000 111 22 333 44 55 6666666666 ° 
999 090 IE ee is) 44 5S 66 7777 «©6888 9499 duu 11 22 333 444 55 66666 066606666 e 

a8 TIO Sue 111) ce 38 44 §55 666 77 8B 999 ou ] 22 333 4464 55 6666 666 5S. 
888 99) Vi) HOEMBR yA 33 446 55 666 an aBd 99 00 2 rc a 2 Y Se ee) T7777 66 5S. 


333, 4666 5 66 777777977777 66 Se 
34 446 85 6677TTTI7T7TINT77717 6 5 


7 95 910 1 2 44 $5 666 777 088 99 009 2 
2 
2 33 644 55 6 777777777 65S, 
2 


1 

M1 

h) 

\} 
444 55 66 Vaael 68 99 Yo J) 
33. 646 55 664 17 38 99 09 1} 
1) 

11 


333 46460=6«S5 666 7/7 B88 Y 4g 32 644 555 66 7717 6 5S be 

2 4455 77 i 1 44 5555 666 66 js 

44 55555 6666606666 ° 

444 35555 ° 

4444 ° 

4 44446444 44 ° 

Program CONTOURING: rea , 
Photographically reduced to 50% of printout size. Seip ? 


Pictorial results of a gravimetric Survey. 


17 


PLOTTER CHOROPLETH MAP 


FIGURE 1.4 
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Graphics are pictorial representations of the data such as bar 


graphs, charts, and line graphs. 


1.3.3.2 Statistical Analysis and Report Generation 


UP eae 


Statistical analysis and report generation is the process of summari- 
Zing data into a format that is usable by the GIS user. Statistical 
analysis can range from the simple calculations of means to sophisti- 
cated multi-variate techniques, such as discriminate analysis and 


factor analysis. While the GIS should be able to generate Simple 


Summary statistics, the majority of statistical analysis will 
probably de done external to the GIS. The GIS will construct 

a data matrix that could be input into such statistical packages 

as SPSS (Statistical Package for the Social Sciences) or BMD II 
(Biomedical Statistical Package). Systems such as SPSS provide 
extensive capabilities for statistical analysis, data - recoding 
transformation and scaling. Report generation also provides 
summary information. Reports are usually in tabular form. Figures 


1.7 and 1.8 are examples of Report Generation. 


Map Overlay Analysis: 


Map overlay (compositing) is the process of overlaying different 
maps for a given area. For example, to produce a habitat map for 
a given species, it might be necessary to overlay a vegetation map, 
a topographic map, and a species map. The composite or overlay 
process can be applied to cellular, polygon, or point data sets. 


Figures 1.9 and 1.10 show the overlay process for a cellular data set. 
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Modeling: 


Since this project is not oriented toward the construction of new 
specialized models, the ability must exist to reformat existing data 


subsets and feed these data into existing and planned biological or 


economic models. For example, many of the OBS Special projects are 


specifically geared toward model development and implementation. 
The GIS being designed must be able to interface with these models 


for both input and output. 


Data Base Management Subsystem 
The Data Base Management Subsystem (DBMS) maintains the geographically 


referenced data base. A data base may be defined as a collection of 


inter-related data stored together without harmful or unnecessary 


redundency. 


The data stored are as independent of the programs that use the 
data as possible. A common or controlled approach is used to 

add new data and modify and retrieve existing data within the data 
base. Figure 1.11 presents a basic set of objectives for a data 
base organization. These objectives are explicitly defined by 

the operating structure and analysis requirements of the potential 


USEVS 3: 


(de 


FIGURE 1.0) Objectives of Data Base Organization 


PRIMARY OBJECTIVES OF DATA BASE ORGANIZATION 


The Data Can Have Multiple Uses 
Different.users who perceive the same data differently can emp loy 
them in different ways. 


Intellectual Investment is Protected 

Existing programs and logical data structures (representing many 
man-years) will not have to be re-done when changes are made to 
the data base. 


Low Cost 
Low cost of storing and using data, and minimization of the high 
cost of making changes. 


Less Data Proliferation 

New application needs may be met with existing data rather fan 
creating new files, thus avoiding the excessive data proliferation 
in todays tape libraries. 


Performance 
Data requests can be satisfied with speed suitable to the usages 
of the data. 


Clarity 
Users can easily know and understand what data is available to them. 


Ease of Use 
Users can gain access to data in a simple fashion. Complexity is 
hidden from the users by the data base management system. 


Flexible Usage 
The data can be used or searched in flexible ways with different 
access paths. 


Unanticipated Requests for Data can Be Handled Quickly 
Spontaneous requests for data can be handled without application 
programs having to be written (a time consuming bottleneck), by 
means of high level query or report generation languages. 


Change is Easy 
The data base can grow and change without interferring with estab- 
lished ways of using the data. 


Accuracy and Consistency 

Accuracy controls will be used. The system will avoid having multi- 
ple versions of the same data items available to users in different 
stages of updating. 


Privacy 
Unauthorized access to the data will be prevented. The same data 
may be restricted in different ways from different users. 


26 
(Figure 1.11 - continued) 


- Protection from Loss or Damages 
Data will be protected from failures and catastrophes, and 
from criminals, vandals, incompetents, and persons who might 
falsely update them. 


- Availability 
Data are quickly available to users at almost all times when 
they need them. 


1 Martin, J. 1976 Computer Data Base Organization. John Wiley. 


3s 4.1 
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A data base management system designed with these objectives 


in mind, provides three basic forms of activity: 


(1) - data retrieval 


(2) - data updating 


(3) - data interface with other agency files. 


Data Retrieval 


Data retrieval is the process of extracting data from the data 


base given 


a set of criteria. The criteria set may be descriptor 


criteria, locational criteria, individual element criteria, or a 


combination of these criteria. Some examples of types of data 


extraction 


required of a GIS are: 


Locate a defined item - image and descriptor 
data (single data set) 


example: Locate the Fort Peck Reservoir in Montana 


Locate a set of defined items - image data and 
descriptor data 


example: Locate all reservoirs on the Three Forks, 
Wyoming, 1:250,000 scale map. 


Locate an item or set of items with criteria un- 
defined - image and descriptor data. 


example: Locate the largest reservoir in the state 
of Montana in terms of average water acreage 


Locate items based on defined relationships - image 
data and descriptor data. 


example: Locate all reservoirs within Larimer 
County within 1 mile of a dairy farm. 


Locate a set of items where criteria are another 
data set-image and descriptor data. 


example: Locate all reservoir sites in Larimer 
County that have non-porous soils. 


(ye ee ye 


Uses, 
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Data Updating 


Data updating allows the user to add, delete, and change 

data in the data base. For example, if a user discovers that 
the descriptive data describing a vegetation cover polygon is 
inaccurate, he should be able to delete the inaccurate infor- 
mation and add corrections. He should also be able, if re- 


quired, to delete the entire polygon and associated des- 


criptive information. Data updating allows for the integrity 

of the data bases to be maintained. Before an update procedure 

may be initiated, it will be necessary to specify a set of regu- 
lations governing the update process. These regulations provide 
a Standardized approach to the update process, protect the data 

bases from unauthorized change, and in general protect the data, 


as much as possible, from human damage. 


The Data Interface 


The data interface with other agencies will allow the FWS system 
to exchange data between FWS files and other state and federal 


agencies files. 


An example of this process might occur when FWS requires 
water quality data. The Environmental Protection Agency (EPA) 
maintains a large data base on water quality (STORET). The FWS 


GIS should be able to integrate EPA data into its own structure. 
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153.424 Data Structure 
These three basic forms of data management activity; 
data retrieval, data update, & data transfer, are tied to 
a common data structure. A common data structure is the 
way the data are logically & physically stored in the com- 
puter. The data structure design is based on the type, 
volume, & retrieval characteristics required by FWS. 
Presently, FWS geographic data requirements Suggest that 


the data structures will have to maintain: 
nominal data (census tract number) 
points (well locations) 
lines (streams) 
polygons (soil types) 


descriptive (data describing points, lines 
and polygons ) 


textual (indexes, bibliographies, etc.) 


The actual logical and physical storage structures have not 


been designed yet. 


The actual decisions on the physical Storage structure wil] 
depend on the types of data to be handled, the desired response 
Speed to a user query, and the volume of data to be handled. 

The volume of data to be processed is one of the more important 
factors. One of the characteristics of geographic data bases 

is very large data volumes. Volumes of 1,000,000 characters 

are not uncommon. Figure 1.12 provides some preliminary calcu- 
lations of data volumes for selected programs in BAO and the 
Regional Offices. These numbers represent the number of records 
for one variable using a cellular or grid coverage (cell 


volumes are easy to calculate). 


DATA VOLUME ESTIMATES FOR FWS* 


PROGRAM SCALE-AREA No. of RECORDS 
BAQ** 
Land Management T10:;,000 107,000 
Habitat Species 1:50,000 3 21,400 
Stream Survey *** P1000 500,000 
1; 24 ,000 20,833 
Utilization of 
Fish & Wildlife 1:25,000 161,280 
Geographic Ref- 
erencing 110,000 200,000 
1:24,000 83, 300 
Regional Mineral 1350),000 83,300 
1250000 40 ,000 
Stream & Structural 
Physical & Biological 2.5 acres 20 ,833 
Data Survey** 
Endangered Species 1:24,000 533, 300 
b255000 51,200 
Ownership & Manage- 
ment Plans 515.200 
Coal Coordination**** 
Habitat Species 1:50,000 806 , 400 
Regional Mineral 1250,000 806 , 400 
Ownership & Manage- 
ment 1:50,000 806,400 
Regional 
YAMPA 
Habitat Species 30 acre 143, 300 
Regional Mineral 30 acre 143,300 
Stream Structural 30 acre 143,300 


* The formula used was: Area/(Scale/1000) 
** BAO estimates for CMR only, which is @ 1.1 million acres 


***Streams data were considered to require half the coverage of other 
data; such as vegetation and soils. 


**x*Entire State of Wyoming 
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Data that are in the format of polygons could have 
smaller or larger data volumes for the same coverage 
depending on how many points are used to specify the 
boundary of the polygon. For most Purposes, the volume 
of data for polygons will be less than for cells. 
Volume is also an important consideration during the 


data input process. 


Data Input Subsystem 


The data input subsystem allows the user to transform 

a manual source document, such as a map, into machine 

readable form for entry into the data base. The data 

input subsystem is comprised of five basic manipulations: 
1) Data acquisition 


2) Image processing of remote sensor data 
and aerial photography 


3) Conversion to machine records 
4) Geocoding 


5) Data editing 


Data acquisition is the process of obtaining data in 

manual or automated form that fulfills the requirements 

for a particular project. The data acquisition function 
must deal with the data, their characteristics, the people 
and institutions that supply them, and their intended uses. 
If possible, the following characteristics should be 


determined for each data source before acquisition: 
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Data Characteristics 


location, ownership, and availability 
data format and volume 

- documentation availability 

- data classification systems 


Institutional Consideration 

- procedures for acquisition 

- continuity of data flow 

- agreements to supply data 

- committee of data suppliers 

- opportunities to change procedures 

- proprietary nature of data 

Evaluation of Data 

- applicability to problem 

- assessments of accuracy and reliability 
Many forms of source data are available as maps, tables, 
aerial photos and multi-spectral remote sensor imagery. 
These data may be primary data, which are unprocessed 
raw data, such as a field geologist's field maps, second- 
ary data, which are organized raw data such as the U.S. 


Census data, sample data, such as animal population 


surveys, or referencing data, such as the USGS graticules. 


In addition to acquisition, some data will require pro- 
cessing before conversion to computer format. Remote 
sensed data requires additional processing. This is 
called "image processing" and usually includes rectifica- 
tion and interpretation of air photos and rectification 


and classification of any satellite data. 
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Conversion to machine records is the actual process of converting the 
source documents into digital form and inserting the resultant data 

into the data base. Running in parallel with the conversion is the 
Process called geocoding. In the case of the FWS GIS, geocoding is 

the assignment of a geographic reference to each data item. This may 
involve a geographic transformation process. Geographic transformation 
allows for the conversion from various map projections, such as Trans- 
verse Mercator or Albers Conformal, into a common Projection. This 
greatly simplifies the structuring of the data base and facilitates 


retrieval. 


Input data editing is the process of correcting errors resulting 
from the conversion of source documents into machine readable form. 
Errors may arise due to human error, software problems, source 
document errors, or hardware errors. These errors could be in- 
correctly spelled names, misassignments of codes to polygons, in- 
accurate air photo interpretation, or inaccurate digitizing of 


cartographic documents. 


Some means must exist to. correct as many errors as possible at the 
point of data entry. Corrections of cartographic data after they 


are in the data base, can be very costly and time consuming. 


] 


ae 0 
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There are two primary considerations in the design of the input sub- 
system: 1) type of source data to be encoded; and 2) the volume of 
data to be encoded. Considering FWS needs, the GIS will be handling 
point, line, polygon, cellular (including remote sensor), and textual 
data. Below are listed the costs of converting these types of data 
from source format to computer format. These are only averages. 
Costs may be higher or lower depending on the particular character- 


istics of the source document. 


These costs are only for the actual encoding. Data preparation and 
data editing costs are not included. 

] Points - between .07 and .15 cents per point. 
2) Lines (arcs) (50 pts) .40 to.70 cents segment 


) 
) 
3) Polygon (50 pts) .50 cents to .80 per polygon 
\ “Grid .03 to: 10) cents per.cels 

) 


5) Textual .005 to .01 cents per character 


As can be seen, the cost range is great. Since these are only "ball park” 
figures, costs could be greater or lesser than those shown above. Data 


preparation and data editing could increase these costs 4 to 6 times. 


User Interface/Language Processor Subsystem 

The user interface/language processor subsystem allows the prospective - 
user to interface with the GIS. It also protects him from the internal 
complexities of the system. While the interface allows the user to 


interact with any portion of the GIS, it is primarily geared toward 


r 
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"Data Base Management" and the "Data Analysis Subsystems." 
The interface provides for 3 externally apparent services and 
One internal hidden service. The internal service is part of 
the tuning Process discussed in 1.3.2 and will not be discussed 
here. The external services are: 

== Tutorial ‘capabilities 

- Query capabilities 

- System security 
A Tutorial capability allows users unfamiliar with the system 
to be "lead by the hand" through the process. It represents 
a hierarchical structure: it allows the facile users to 
Suppress the tutorial, the moderate users to refresh their memory , 
and keeps naive users from having to run to a manual each time 
there is a problem. The query capability allows the user to 
Phrase a question in an easily understood format, called a 
query language. The user can thus easily manipulate data, re- 
trieve data, and display and analyze data. The tutorial function 
is important for maintaining a high technical standard in the 
system. Yet tutorials have usually been overlooked in other 


GIS development. 


System security prevents unauthorized users from entering the 
system, prevents unauthorized use of certain sections of the 
data base, and prevents a user from performing a function that 


could destroy data, cost too much, or degradate system integrity. 


Summa ry _ 

The function of this paper has been to outline the general 
conceptual design of a GIS for FWS. A broad set of system capa- 
bilities was presented. This represents the first major step 


in the design process. 
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The next steps will serve to state the system capabilities in much 
finer detail, especially at the technical level. The detailed system 
capabilities can be stated once a better grasp of data sources, data 
volumes, specific types of analysis, and sources of available (usable) 
computer software have been specified. These are the primary functions 
of Task 2 and Task 3 of this project. Basically, these functions 
will involve 1) detailed analysis of the characteristics of the 
required data, 2) analysis of the output requirements for the 

system, and 3) reflecting these first two analyses, the development 

of a very detailed system description. The detailed system description 


can then be used as the basis for software selection. 
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Appendix A 
GLOSSARY OF TERMS USED IN THIS REPORT 


Geography: Analysis of the patterns, structures, 
arrangements, and relationships of man's 
work on the surface of the earth (human 
geography) and of the patterns, structures 
and relationships of the earth itself 
(physical geography). 


Spatial: A key word used by geographers to indi- 
cate that they are interested in the 
relationships of phenomenon located in 
two and three dimensional space either in 
the abstract (the mathematics of point 
distributions) or on the ground (analysis 
of the habitats for various wildlife 


species). 
Geographic Explicit two and three dimensional loca- 
Reference: tors of a phenomenon, such as the Latitude- 


Longitude coordinates of a water well site. 


Geocode: The process of attaching a geographic 
reference to a phenomenon, such as assign- 
ing a Latitude-Longitude to a water well 


site. 

Spatial Analysis: Techniques of analyzing geographically 
referenced data. 

Spatial Display: Methods of visually depicting geographical ly 
referenced data (contour maps or choropleth 
maps ). 

Data: Data are raw numbers that convey little or 


no information in themselves. 


Information: Organization of data so that they are ina 
form useable for decision making. 


System: A system is a collection of entities and 
activities meaningfully connected and 
satisfactorily bounded which interact for 
a common purpose or purposes. 


(Appendix A - continued) 


Information 
Systems: 


Geographic 
Information 
System: 


Data Field: 


Record: 


File: 


Data Base: 


Data Base 
System: 


Hardware: 


Software: 


Data Base 


Management System: 


Data Input: 
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When information or data resources are 
joined together in both a formal and in- 
formal manner, they are considered to 
have some organized structure. They 
become an information system to support 
the decision making processes of an 
organization. 


An information system that can input, 
manipulate, and analyze geographically 
referenced data in order to support the 
decision making processes of an organi- 
zation. 


A data field is a collection of data items 
within a record. For example, a data 
field called DATE may be composed of data 
items MONTH, DAY, YEAR. 


A record is a named collection of data 
fields. For example, the Joseph Smith 
may be composed of the data fields 
EDUCATION, DATE and OCCUPATION. 


A file is a named collection of occur- 
rences of a given type of record. For 
example, a named file called EMPLOYEES 
may be composed of records for Joseph 
Smith, Peter Strong, and Harry Jones. 


A data base is a collection of the occur- 
rences of multiple record types, contain- 
ing relationships between records, data 
fields and data items. For example, 
personnel files are a data base. 


An organized collection of data bases. 
The actual physical equipment of a com- 
puter system (line printers, cabinets, 


teletypes, card readers, etc.). 


A set of instructions, either man generated 


or machine generated, that tell the computer 


what to do. 


A set of software that allows the user to 
access and change data in a data base. 


The process of transforming data from a 
manual form to a form that the computer 
can understand. 


39 


Appendix B 


ABSTRACT Report 1.1 


This report documents the results of a five month survey of 
user needs for a geographic information system witin Region Six of 
the U.S. Fish and Wildlife Service (FWS). This report culminates the 
first of seven project tasks being performed under contract to the U.S. 
Department of the Interior, U.S. Fish and Wildlife Service, Office of 
Biological Services, Western Energy and Land Use Team (WELUT). 

This project is defined and sponsored by the Western Energy 
and Land Use Team to promote more effective consideration of fish 
and wildlife values and resources in the numerous state and federal 
decisions concerning the rapid development of western land, energy, 
mineral and water resources. 

The goal of this two year project is to develop an operational 
capability within the Fish and Wildlife Service to accept, store, mani- 
pulate and display spatially related data for use in a variety of FWS 
wildlife resource impact projects. 

This task focused on the assessment of three groups of users: 
(1) the Denver Region Six Offices of FWS, (2) the Billings Area Office 
within Region Six, and (3) the Office of Biological Services Special 
Projects. Some twenty individual user offices were interviewed with 
structured questionnaires and follow-up sessions to identify their 
legislative mandates, programs, tasks, work elements, and specific 


spatial data needs. 
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Seven logical groups of data emerge as those which are required 
by multiple users. These generic data bundles or data compilation. 
modules include: 

1) Habitat-Species Distribution 
2) Stream Structural, Physical & Biological Survey Data 
3) Socio-Economic & Gross Land Use Change 


4) Seasonal Demand For and Utilization of 
Fish and Wildlife Resources 


5) Geographically Referenced FWS Land Ownership 
& Lease Data 


6) Regional Mineral & Energy Related Activity Pattern Data 


7) Ownership & Management Plans of Other Agencies 


Individual user groups will require the results of each of these 
compilation modules to differ by 1) spatial resolution (from 2 1/2 
acre polygons for site specific planning to 640 acre polygons for macro 
regional analysis), 2) data currency or updating frequency (from 3 
month currency to 10 year currency), and 3) geographical coverage 
(from a 200 square mile site specific problem to an 85,000 square mile 
Statewide analysis). 

Representative users and cases can be selected from this broad 
user community. Working on selected test cases for these few users 
will demonstrate the solution of specific problems which can be applied 


to the more general group of users. 
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ABSTRACT Report 1.3 


This appendix includes the forms used for the user needs 
assessment performed under contract to the U.S. Department of the 
Interior, U.S. Fish and Wildlife Service, Office of Biological 
Services. 

The forms in this report are the results of interviews 
with twenty groups of users in a series of introductory meetings. 


The groups interviewed consisted of the following: 


1) Ten Billings Area Office Groups 
2) Six Denver Region Office Groups 


3) Four OBS Special Project Groups 


Form #1 identifies the mandates, general programs, and staffing 
for each organizational element, Form #2 describes the work elements 
of each program, and Form #3 describes the detailed spatial data 
needs, processing, and outputs required for each work element. These 
forms start with general programs and successively move toward 


greater detail. 
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ABSTRACT Report 1.4 


The "User Needs Assessment for an Operational Geographic 
Information System" (Report 1.1) identified many FWS programs which 
would be benefitted by a GIS. To demonstrate the relevance of a GIS 
to these programs, seven candidate test areas are proposed (Figure 
1.1). These test areas are primarily meant to serve as prototype 
demonstrations of a GIS. However, to make the demonstrations 
relevant and meaningful to FWS users, the test areas were delineated 


to address real FWS problems whenever possible. 
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